Unsupervised Ontology Acquisition from Plain Texts: The OntoGain System
نویسندگان
چکیده
We propose OntoGain, a system for unsupervised ontology acquisition from unstructured text which relies on multi-word term extraction. For the acquisition of taxonomic relations, we exploit inherent multi-word terms’ lexical information in a comparative implementation of agglomerative hierarchical clustering and formal concept analysis methods. For the detection of non-taxonomic relations, we comparatively investigate in OntoGain an association rules based algorithm and a probabilistic algorithm. The OntoGain system allows for transformation of the derived ontology into standard OWL statements. OntoGain results are compared to both hand-crafted ontologies, as well as to a state-of-the art system, in two different domains: the medical and computer science domains.
منابع مشابه
Effective Ontology Learning : Concepts' Hierarchy Building using Plain Text Wikipedia
Ontologies stand in the heart of the Semantic Web. Nevertheless, heavyweight or formal ontologies’ engineering is being commonly judged to be a tough exercise which requires time and heavy costs. Ontology Learning is thus a solution for this exigency and an approach for the ‘knowledge acquisition bottleneck’. Since texts are massively available everywhere, making up of experts’ knowledge and th...
متن کاملطراحی سامانه هوشمند ساخت هستان نگار به کمک شبکه عصبی ARTو روشC-value
In recent years, many efforts have been done to design ontology learning methods and automate ontology construction process. The ontology construction process is a time-consuming and costly procedure for almost all domains/applications, so automating this process is a solution to overcome the knowledge acquisition bottleneck in information systems and reduce the construction cost. In this artic...
متن کاملThe GENIA Project: Knowledge Acquisition from Biology Texts
Overview of Project The GENIA project [9] (Fig. 1) seeks to automatically extract useful information from texts written by scientists to help overcome the problems caused by information overload. We intend that while the methods are customized for application in the microbiology domain, the basic methods should be generalisable to knowledge acquisition in other scientific and engineering domain...
متن کاملAutomatic Rule Retrieval from Websites Using Ontologyand Text Mining
A Rule-based system like an intelligent service comparing portal may compare product prices, shipping options, refund options etc., Such rule based system requires an automatic knowledge acquisition procedure from the Web that consists of unstructured texts. Knowledge acquisition can be carried out by ontology acquisition and rule acquisition. Obtaining information such as product prices from w...
متن کاملAcquisition of Semantic Knowledge using Machine learningmethods : The System
We describe in this paper the ML system ASIUM which acquires semantic knowledge from parsed technical texts. ASIUM is devoted to the acquisition of case frames and ontologies. Applications requiring case frames and ontologies are numerous. The Dassault Aviation company we are collaborating with is mainly interested in controlling semantics of speciication texts, in terminology acquisition for s...
متن کامل